Consistent Biclustering and Applications to Agriculture
نویسندگان
چکیده
Consistent biclusterings of training sets can be exploited for solving classification problems in data mining. This technique has been mainly applied so far to solve classification problems related to gene expression data. However, it can be successfully applied to problems arising in other domains, and it is also able to provide information on the features causing the classification of the training set. We provide a quick overview of this technique, and we present a study on a particular problem arising in the agricultural field. We consider the problem of predicting problematic fermentations of wine at early stages of the vinification. The presented computational experiments show that the considered technique is able to provide some clues on the possible features causing the problematic fermentations.
منابع مشابه
Extending the definition of beta-consistent biclustering for feature selection
Consistent biclusterings of sets of data are useful for solving feature selection and classification problems. The problem of finding a consistent biclustering can be formulated as a combinatorial optimization problem, and it can be solved by the employment of a recently proposed VNS-based heuristic. In this context, the concept of β-consistent biclustering has been introduced for dealing with ...
متن کاملModels and Issues in Consistent Biclustering
Biclustering is a methodology allowing simultaneous partitioning of a set of samples and their features into classes. Samples and features classified together are supposed to have a high relevance to each other which can be observed by intensity of their expressions. The notion of consistency for biclustering is defined using interrelation between centroids of sample and feature classes. Consis...
متن کاملFuzzy Adaptive Resonance Theory, Diffusion Maps and their applications to Clustering and Biclustering
In this paper, we describe an algorithm FARDiff (Fuzzy Adaptive Resonance Diffusion) which combines Diffusion Maps and Fuzzy Adaptive Resonance Theory to do clustering and biclustering on high dimensional data. We describe some applications of this method.
متن کاملBiclustering in data mining
Biclustering consists in simultaneous partitioning of the set of samples and the set of their attributes (features) into subsets (classes). Samples and features classified together are supposed to have a high relevance to each other. In this paper we review the most widely used and successful biclustering techniques and their related applications. This survey is written from a theoretical viewp...
متن کاملA structured view on pattern mining-based biclustering
Mining matrices to find relevant biclusters, subsets of rows exhibiting a coherent pattern over a subset of columns, is a critical task for a wide-set of biomedical and social applications. Since biclustering is a challenging combinatorial optimization task, existing approaches place restrictions on the allowed structure, coherence and quality of biclusters. Biclustering approaches relying on p...
متن کامل